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Abstract. We motivate and give semantics to theory presentation com- 
binators as the foundational building blocks for a scalable library of 
theories. The key observation is that the category of contexts and fibered 
categories are the ideal theoretical tools for this purpose. 



1 Introduction 

A mechanized mathematics system, to be useful, must possess a large 
library of mathematical knowledge, on top of sound foundations. While 
sound foundations contain many interesting intellectual challenges, build- 
ing a large library seems a daunting task because of its sheer volume. 
However, as has been well-documented [5,6,12], there is a tremendous 
amount of redundancy in existing libraries. 

Our aim is to build tools that allow library developers to take ad- 
vantage of all the commonalities in mathematics so as to build a large, 
rich library for end-users, whilst expending much less actual development 
effort. In other words, we continue with our approach of developing High 
Level Theories [4] through building a network of theories, by putting our 
previous experiments [5] on a sound theoretical basis. 

1.1 The Problem 

The problem which motivates this research is fairly simple: give devel- 
opers of mathematical libraries the foundational tools they need to take 
advantage of the inherent structure of mathematical theories, as first class 
mathematical objects in their own right. Figure 1 shows the type of struc- 
ture we are talking about: The presentation of the theory Semigroup 
strictly contains that of the theory Magma, and this information should 
not be duplicated. A further requirement is that we need to be able to 
selectively hide (and reveal) this structure from end- users. 
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The motivation for these tools should be obvi- 
ous, but let us nevertheless spell it out: we simply 
cannot afford to spend the human resources nec- 
essary (one estimate was 140 person-years [20]; [1] 
explore this topic in much greater depth) to de- 
velop yet another mathematical library. In fact, 
as we now know that there is a lot of structured 
redundancy in such libraries, it would be down- 
right foolish to not take full advantage of that. As 
a minor benefit, it can also help reduce errors in 
axiomatizations. 

The motivation for being able to selectively 
hide or reveal some of this structure is less straight- 
forward. It stems from our observation [4] that in 
practice, when mathematicians are using theories rather than developing 
news ones, they tend to work in a rather "flat" name space. An analogy: 
someone working in Group Theory will unconsciously assume the avail- 
ability of all concepts from a standard textbook, with their "usual" names 
and meanings. As their goal is to get some work done, whatever structure 
system builders have decided to use to construct their system should not 
leak into the application domain. They may not be aware of the existence 
of pointed semigroups, nor should that awareness be forced upon them. 
Some application domains rely on the "structure of theories", so we can 
allow those users to see it. 



Magma 
I 

Semigroup 
I 

Pointed Semigroup 
I 

Monoid 
I 

Group 
I 

Abelian Group 
Fig. 1. Theories 



1.2 Contributions 

To be explicit, our contributions include: 

— A variant of the category of contexts, over a dependently-typed type 
theory as the semantics for theory presentations. 

— A simple term language for building theories, using "classical" nomen- 
clature, even though our foundations are unabashedly categorical. 

— Using "tiny theories" to allow for maximal reuse and modularity. 

— Taking names seriously, since these are meant for human consumption. 
Moreover, we further emphasize that theory presentations are purely 
syntactic objects, which are meant to denote a semantic object. 

— Treating arrows seriously: while this is obvious from a categorical 
standpoint, it is nevertheless novel in this application. 

— Giving multiple (compatible) semantics to our language, which better 
capture the complete knowledge context of the terms. 
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1.3 Plan of paper 

We motivate our work with concrete examples in section 2. The theoretical 
foundations of our work, the fibered category of contexts, is presented in 
full detail in section 3. This allow us in section 4 to formalize the language 
of our motivation section, syntactically and semantically. We close with 
some discussion, related work and conclusions in sections 5-7. 

2 Motivation for Theory Presentation Combinators 

Let us compare the presentation of two simple theories: 

Monoid := Theory { 

U:type; *:(U,U) -> U; e :U; 

axiom r ight I dent i t y _ * _e : forall x:U. x*e = x; 

axiom 1 e f 1 1 d e n t i t y _ * _e : forall x:U. e*x = x; 

axiom associative.*: forall x,y,z:U. (x*y)*z = x*(y*z)} 

CommutativeMonoid := Theory { 
U:type; *:(U,U) -> U; e:U; 

axiom righ t Id e n t i t y _ * _e : forall x:U. x*e = x; 

axiom 1 e f 1 1 d e n t i t y _ * _e : forall x:U. e*x = x; 

axiom associative.*: forall x,y,z:U. (x*y)*z = x*(y*z); 

axiom commutative.*: forall x,y:U. x*y = y*x} 

They are identical, save for the commutative.* axiom, as expected. Given 
Monoid, it would be much more economical to define 

CommutativeMonoid := Monoid extended by { 

axiom commutative.*: forall x,y:U. x*y = y*x} 

and "expand" this definition, if necessary. Of course, given Group, we 
would similarly find ourselves writing 

CommutativeGroup := Group extended by { 

axiom commutative.*: forall x,y:U. x*y = y*x} 

which is also wasteful, as well as dangerous: is this "the same" axiom as 
before, or a different one? There is no real way to tell. It is natural to 
further extend our language with a facility that expresses this sharing. 
Taking a cue from previous work, we might want to say 

CommutativeGroup := combine CommutativeMonoid , Group over Monoid 

Informally, this can be read as saying that Group and CommutativeMonoid 
are both "extensions" of Monoid, and CommutativeGroup is formed by the 
union (amalgamated sum) of those extensions. Another frequent feature 
is renaming: an AbelianGroup, while isomorphic to a CommutativeGroup, 
is usually presented additively. We could express this as 
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AbelianGroup := CommutativeGroup [ * | — > +, o | — > ] 

Unfortunately, while this "works" to build a sizeable library (say of 
the order of 500 concepts) in a very economical way, it is quite brittle. 
Let us examine the reasons. It should be clear that by combine, we really 
mean pushout 1 . But a pushout is a 5-ary operation on 3 objects and 2 
arrows; our syntax gives the 3 objects and leaves the arrows implicit. This 
is a very serious mistake: these arrows are (in general) not easy to infer, 
especially in the presence of renaming. For example, there are two dis- 
tinct arrows from Monoid to Ring, with neither arrow being "better" than 
the other. Furthermore, we know that pushouts can also be regarded as a 
2-ary operation on arrows. In other words, even though our goal is to pro- 
duce theory presentations, our decision to use pushouts 2 as a fundamental 
building block gives us no choice but to take arrows seriously. 

So our task is now to find a category with "theory presentations" as 
objects, and with arrows which somehow express the notions of extending, 
combining and renaming as defined above. But before we explore that 
in depth, let us further examine our operations. First, there is nothing 
specific to CommutativeGroup in the renaming * i— > +, e i— > 0, this can be 
applied to any theory where the pairs (*, +) and (e, 0) have compatible 
signatures (including being undefined). Similarly, extend really defines a 
"construction" which can be applied whenever all the symbols used in 
the extension are defined. In other words, a reasonable semantics should 
associate a whole class of arrows 3 to these operations. 

But there is one more aspect to consider: in all our examples above, 
we have used short, meaningful names. While great for humans, they 
are in part at fault in the failure of being able to infer arrows. If, like 
in MMT [15], we used long names, might we be able to build a robust 
system? Maybe so, but it would immediately fall afoul of our second 
requirement: irrelevant information such as choices made by developers 
regarding the order in which to build theories, would leak into the long 
names, and thus be seen by users. Furthermore, when there is ambiguity, 
a long name system can indeed resolve that ambiguity, but at too high a 
cost to humans in absurdly long names for certain concepts. 

In other words, to be able to maintain human-readable names for all 
concepts, we will put the burden on the library developers to come up 
with a reasonable naming scheme, rather than to push that issue onto 
end users. Another way to see this is that symbol choice carries a lot of 

1 Following Burstall and Goguen [2] and Smith [17, 18] and many others since. 

2 which will in fact become pullbacks 

3 We are being deliberately vague here, Section 3 will make this precise. 
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intentional, as well as contextual, information which is commonly used 
in mathematical practice. Thus, to avoid leaking irrelevant information 
and to maintain intentional/contextual information, we will insist that 
on taking names seriously. 

3 Category of contexts 

We observe that theories from the previous section can all be specified 
as contexts of some dependent type theory. The work in this paper is 
abstract over the exact details of the dependent type theory, 4 so we simply 
assume that some dependent type theory is given. Following Cartmell [7], 
we form the category of contexts C of the given dependent type theory. 
The objects of C are contexts r that occur in judgements like r \- s : a 
of the dependent type theory. A context r consists of a sequence of pairs 
of labels and types (or kinds or propositions), 



holds (resp. : Kind, or : Prop). Contexts of dependent type theory can 
be used to define the types, operations, relations and axioms of a the- 
ory. We will use the abbreviation (x : c)q _1 for a context r, and , for 
concatenation of two such sequences. 

Example 1. We can define the theory of semigroups via 



where we use Haskell-style notation where (□) indicates (the name of) a 
binary function used infix in terms. 

Normally contexts are considered up to a-equivalence, that is, renam- 
ing or permuting the labels of a context makes no difference. But since 
labels do make a difference, we will not do so. However, a-equivalent 
terms and types continue to be considered equivalent. 

4 In fact, we expect this work to apply not only to dependent type theories, but to 
any classifying category [13]. 



r := (xq : ctq; . . . ; x n -i : cr n _i) , 



such that for each i < n the judgement 



(x : cr ; . . . ; Xj_i : <7j_i) h a { : Type 




U : Type 

(*):£/ x U -> U 
associative : Vx, y , z : U. (x * y) * z = x * (y * z) 
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Example 2. The signature for AdditiveSemigroup is given as the context 

/ U : Type 

/ (+):U xU ->U 

\ associative : Vx, y, z : U. (x + y) + z = x + (y + z) 

Traditionally Semigroup and AdditiveSemigroup would be considered 
the same context because they are a-equivalent. 

In the rest of this section, we will use the convention that r = 
(x : c)o _1 and A = (y : r}™^ 1 . Given two contexts r and A, a mor- 
phism r —7- A of C consists of an assignment [yo >— > to, . . . , y rn i— >■ i m _i], 
abbreviated as [y i— >■ t]™ 1 where the to, ... , t m -\ are terms such that 

r h to : t ... r h t m _i : r m _i [y ^ t]™" 2 

all hold, where r [y i— > t] l denotes the type r with the labels yo, ■ ■ ■ ,y% 
substituted by the corresponding terms of the assignment. We will also 
use n to denote concatenation of assignments, and \yf(j) H> t g (j)]*_ o for 
the "obvious" generalized assignment. 

Notice that an arrow from r to A is an assignment from the labels of 
A to terms in r. This definition of an arrow may seem backwards at first, 
but it is defined this way because arrows transform "models" of theories 
of r to "models" of theories of A. For example, every Abelian Semigroup 
is, or rather can be transformed into, an Additive Semigroup by simply 
forgetting that the Semigroup is Abelian. A later example 4 will give 
the explicit arrow from Abelian Semigroup to Additive Semigroup that 
captures this transformation. 

Let us fix V as the (countable) infinite set of labels used in contexts. 
If 7r : "V — > V is a permutation of labels, then we can define an action of 
this permutation on terms, types and contexts: 

7T • (x : cr)^ 1 := (it (x ) : tt ■ a ; ■ ■ ■ ;tt (s„_i) : tt ■ cr n _i) = {ttx : ir ■ a)^ 1 

where tt ■ U{ is the action induced on the (dependent) type <7j by renaming 
labels. The action of tt induces an endofunctor (tt ■ — ) : C — >■ C Further- 
more, each permutation tt : "V — > "V induces a natural transformation in 
I n : (tt ■ — ) =>- id<c where 

h (-0 := [XO >->■ TT (X ) X n _l (->■ TT (X n -l)] -.TT-T^r. 

We call an assignment of the form /„- (r) a renaming. Because permuta- 
tions are invertible, each renaming I n (r) : tt ■ r — > r is an isomorphism 
whose inverse is the renaming I w -i (tt ■ r) : r — > it ■ r. From this we can 
see that a-equivalent contexts are isomorphic. 
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Example 3. Let ir : "V — > "V be some permutation such that ir (U) = U, 
7r((*)) = (+), and ir (associative) = associative. By the definition of 
1^ (Semigroup) : AdditiveSemigroup — > Semigroup, we have that 

1-k (Semigroup) := [U >-¥ U; (*) i— > (+) ; associative i— >■ associative] 

is a renaming isomorphism between the contexts in examples 1 and 2. 

The category of nominal assignments, B, a sub-category of C will be 
quite important for use. For example, theorem 2 will show that IB is the 
base category of a fibration. 

Definition 1. The category of nominal assignments, B, has the same 
objects as <C, but only those morphisms whose terms are labels. 

Thus a morphism in B is an assignment of the form [m i-> ^a(i)]™ Q 1 such 
that the judgements 

rhx a(0) :r ... r h x o(m _i) : r m _i H- x a(i) ]™Q 2 

all hold. 

Definition 2. VFe define T to be a sub-context of T + if every element 
x : t of r occurs in r + . 

Definition 3. We call an assignment A : r — > A a diagonal assignment 
if A is of the form [y i-> y]g 1 (where A = (y : t)^ 1 ), denoted by 5a ■ 

r -> z\. 

Definition 4. assignment A : r + T is an extension u>/ien r is a 
sub-context of T + , and A is the diagonal assignment. 

Notice that an extension points from the extended context to the sub- 
context. This is the reverse from what Burstall and Goguen [2] use (and 
most of the algebraic specification community followed their lead). Our 
direction is inherited from <D, the category of contexts, which is later 
required by theorem 2 to satisfy the technical definition of a fibration. 

Example 4- Consider the theory AbelianSemigroup given as 

U : U : Type 
(+) : U x U -> U \ 
associative : Vx, y, z : U. (x + y) + z = x + (y + z) I 
commutative : Vx, y : U. (x + y) = (y + x) 

Then ^AdditiveSemigroup : AbelianSemigroup — > AdditiveSemigroup is an 

extension. 
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Example 5. Consider the following two distinct contexts (Ci,C2) for the 
theory of left unital Magmas with the order of their operators swapped: 



The diagonal assignment 5c x '■ C2 — > C\ is an extension (as is 5c 2 ■ Ci — > 



Notice that, for any given contexts E + and E, there exists an extension 
E + — > E if and only if E is a sub-context of E + . If E is a sub-context of 
E + then the diagonal assignment Sp '■ E + — > E is the unique extension. 

In general, a renaming I n : it ■ E — > E will not be an extension unless 
-/r is the identity on the labels from E. In our work, both renaming and 
extentions are used together, so we want to consider a broader class of 
nominal assignments that include both extensions and renamings. 

Definition 5. Those nominal assignments where every label occurs at 
most once will be called general extensions. 

We see that for every permutation of labels 7r : "V —> V and every 
context E that I w (E) : it ■ r — >■ r is a general extension (and hence also 
a nominal assignment). 

Theorem 1. Every general extension A : r + — > A can be turned into an 
extension by composing it with an appropriate renaming. 

The proof of this theorem, along with all other theorems, lemmas and 
corollaries in this section can be found in Appendix A. 

Corollary 1. Every general extension A : T + — > A can be decomposed 
into an extension A e : E + — > E followed by a renaming A r : E — >■ A. 

These general extensions form a category which plays an important role. 



U 



Type 



U 



Type 




c 2 ). 



E+ 




A+ 



Definition 6. The category of general extensions 
IE has all general extensions from B as objects, and 
given two general extensions A : E + — > E and B : 
A + — >■ A, an arrow f : A — > B is a commutative 
square from B. We will denote this commutative 
square by (f + , f~) : A — >■ B. 



A 



B 



E 



A 



f 
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We remind the reader of the usual convention in category theory where ar- 
rows include their domain and codomain as part of their structure (which 
we implicitly use in the definition above). 

Lemma 1. Every general extension is isomorphic in IE to an extension 
B : r° — > r where r is an initial segment of T°. 

This category of general extensions IE is fibered over the category 
B by the codomain functor cod : E — > B. Given general extensions 
A : r + ->■ r and B : A + -> A and a morphism (/+, /~) : A ->• B in E 
we have 

cod(A):=r cod (/):=/- 

Theorem 2. The functor cod : E — > B is a fibration. 

Corollary 2. Given u : r — > A, a general extension A : A + — > A, and 

a cartesian lifting u {A) : u* (A) A, if u is a general extension, then 
u(A) + is also a general extension. 

Example 6. The nominal assignment (and general extension) 

: AbelianSemigroup — > Semigroup 



u :- 



(*) (+) 



associative i— >■ associative 



and the extension A := ^semigroup : Monoid — > Semigroup induce the 
existence (via theorem 2) of some Cartesian lifting u (A) : u* (A) — > A in 
E. One example of such a Cartesian lifting for u is 



AbelianMonoid 
AbelianSemigroup 



u(Ay 



Monoid 



Semigroup 



where AbelianMonoid is 

U : Type 
: U 

I (+):U xU 

( rightldentity : Vx : U.x + = x 
\ leftldentity : Vx : £7.0 + x = x 

associative : Vx, y, z : U. (x + y) + z = x + (y + z) 
commutative : Vx, y : U. (x + y) = (y + x) 
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and u* (A) : AbelianMonoid — > AbelianSemigroup is the diagonal as- 
signment, where u {A) + : AbelianMonoid — > Monoid is 



u(A) + :-- 



(*) -> (+) 
rightldentity h-» rightldentity 

leftldentity i-> leftldentity 

associative i-> associative 



In almost all of the develop- 
ment of the algebraic hierarchy, 
the nominal assignments that 
we use are all general exten- 
sions. However, it is important 
to note that the definition of a 
Cartesian lifting requires nom- 
inal assignments that are not 
necessarily general extensions, 
even if all the inputs are general 
extensions. 

Consider the simple case 
(pictured above) where u : (U : Type) — > () is the unique exten- 
sion, and a Cartesian lifting of u over itself. The mediating arrow for 
id : (U : Type) — > (U : Type) and itself must be 

/ : (U : Type) -> (U : Type; U' : Type) 
f:=[U*-> U, U' ' y U] 




(U : Type) 



u 







which is not a general extension. 



4 Semantics of Theory Presentation Combinators 

Like in the previous section, we will assume that we have a background 
type theory with well-formedness judgments, which defines four differ- 
ent sorts, namely (Type, Term, Kind, Prop). The symbols used in the type 
theory itself will be called labels, whereas the symbols used for theory 
presentations will be called names. As above, a i-> b denotes a substitu- 
tion. Using this, we can define the formal syntax for our combinators as 
follows. 
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a, b, c G labels 
A, B, C G names 

I G judgments* 
r G (aj h-> 



r G Type 
fc G Kind 
t G Term 
G Prop 



tpc ::= extend A by {/} 

combine A n, B r 2 
A ; B 
A r 
Empty 
Theory {/} 

Intuitively, the six forms correspond to: extending a theory with new 
knowledge, combining two theories into a larger one, sequential compo- 
sition of theories, renaming, a constant for the Empty theory, and an 
explicit theory. 

What we do next is slightly unusual: rather than give a single deno- 
tational semantics, we will give two, one in terms of objects of B, and 
one in terms of objects of IE (which are special arrows in B). In fact, we 
have a third semantics, in terms of (partial) Functors over the contextual 
category, but we will omit it for lack of space. First, we give the semantics 
in terms of objects of B, where [— } n is the (obvious) semantics in "V — > V 
of a renaming. 



[EmptyJjB 
[Theory {1}] M 

{A r] B 
[A; B}n 
[extend A by {7}]b 
[combine A 1 r 1 ,A 2 r 2 ]]B 



D > A x 



Sa 



A, 



-> A 



tpc |B| 


(I) 

Irh ■ Mb 
= IBU 

Mb? (1) - 5 A 

D 

where D comes from the (potential) pullback diagram on the right, in 
which we omit [— Jb around the As for clarity. We use = to abbreviate 
"when the rhs is a well-formed context". For the semantics of combine, it 
must be the case where the diagram at right is a pullback (in B), where 
A is the greatest lower bound context [vIiJb n [^Jb- Furthermore [rifl^ 
and [^Jtt must leave A invariant. We remind the reader of the require- 
ment for these renamings: the users must pick which cartesian lifting they 
want, and this cannot be done automatically (as demonstrated at the end 
of last section) . 

The second semantics, is in terms of the objects of IE, in other words, 
the special arrows of B, as defined in Section 3. 
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[-] E : tpc^ |E| 
[EmptyJ E = id () 
[Theory {/}] E * ! (0 

[Ar] E = [r] ff -[A] E 
[A;£] E = [A] E o [B] E 
[extend ,4 by {Z}J E <5 A 
[combine Ain, A 2 r 2 ] E = [n] w o 5 Tl o [Ai] E 

= Hx«*ibo[A 2 ] E 

The diagram on the right has to be verified to be a pullback diagram 
(this is why the semantics is partial here too). Here we assume [Ai] E € 
Hom(Ti,T) and [A 2 ] E G Hom(T 2 ,T), and that both [n] w and [r^ 
leave T invariant. 

Theorem 3. For all tpc terms except combine, [sJb = dom[sJ E . WTien 
s = combine A 1 r 1 ,A 2 r 2 , if cod ([AiJ E ) = cod ([A 2 ] E ) = [^iIb n [A 2 ] B , 
and neither arrows [Ai] E nor [A 2 ] E involve renamings, then [sJb = 
dom[s]] E in that case as well. 

The proof is a straightforward comparison of the semantic equations. 
This theorem basically says that, as long as we only use combine on the 
"natural" base of two arrows which are pure extensions, our semantics 
are compatible. In a tiny theories setting, this can be arranged. 

5 Discussion 

It is important to note that we are essentially parametric in the underlying 
type theory. This should allow us to be able to generalize our work in ways 
similar to Kohlhase and Rabe's MMT [15]. 

The careful reader might have notice that in the syntax of section 2, 
our combine had an over keyword. This allowed our previous implemen- 
tation [5] to come partway to the IE semantics above. This is a straight- 
forward extension to the semantics: [combine Airi,A 2 r 2 over CJb would 
replace A = [Ai]b n [A 2 J E with [CJb, with corresponding adjustments 
to the rest of the pushout diagram. For [— ] E , one would insist that 
cod ([Ax] E ) = cod ([A 2 ] E ) = [C] B . 



D > Ti 



Ai 



T 2 



Ao 
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What is more promising 5 still is that most of our terms can also 
be interpreted as Functors between fibered categories. This gives us a 
semantic for each term as a "construction", which can be reused (as in 
our example with commutativity in section 2). Furthermore, since fibered 
categories interact well will limits and colimits, we should also be able to 
combine constructions and diagrams so as to fruitfully capture further 
structure in theory hierarchies. 

It should also be noted that our work also extends without difficulty 
to having definitions (and other such conservative extensions) in our con- 
texts. This is especially useful when transporting theorems from one set- 
ting to another, as is done when using the "Little Theories" method [9]. 
We also expect our work to extend to allow Cartesian liftings of extensions 
over arbitrary assignments (aka views) from the full category of contexts. 

Lastly, we have implemented a "flattener" for our semantics, which ba- 
sically turns a presentation A into a flat presentation Theoryj 1} by com- 
puting cod ([A[]e). This fulfils our second requirement, where the method 
of construction of a theory is invisible to users of flat theories. 

6 Related Work 

We will not consider work in universal algebra, institutions or categorical 
logic as "related" , since they employ a-equivalence on labels (as well as on 
bound variables), which we consider un- helpful for theory presentations 
meant for human consumption. We also leave aside much interesting work 
on dependent record types (which we use), as these are but one imple- 
mentation method for theories, and we consider contexts as a much more 
fundamental object. 

We have been highly influenced by the early work of Burstall and 
Goguen [2, 3], and Doug Smith's Specware [17, 18]. They gave us the basic 
semantic tools we needed. But we quickly found out, much to our dismay, 
that neither of these approaches seemed to scale very well. Later, we were 
hopeful that CASL [8] might work for us, but then found that their own 
base library was improperly factored and full of redundancies. Of the vast 
algebraic specification literature around this topic, we want to single out 
the work of Oriat [14] on isomorphism of specification graphs as capturing 
similar ideas to ours on extreme modularity. And it cannot be emphasized 
enough how crucial Bart Jacob's book [13] has been to our work. 

From the mathematical knowledge management side, it should be 
clear that MMT [15] is closely related. The main differences are that they 



Work in progress 
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are quite explicit about being foundations-independent (it is implicit in 
our work), they use long names, and their theory operations are mostly 
theory-internal, while ours are external. This makes a big difference, as it 
allows us to have multiple semantics, while theirs has to be fixed. And, of 
course, the work presented in the current paper covers just a small part 
of the vast scope of MMT. 

There are many published techniques and implementations of alge- 
braic hierarchies in dependently typed proof assistants including [11, 
19, 10, 16]. Our work does not compete with these implementations, but 
rather complements them. More specifically, we envision our work as a 
meta-language which can be used to specify algebraic hierarchies, which 
can subsequently be implemented by using any of the aforementioned 
techniques. In particular we note that maintaining the correct structures 
for packed-classes of [10] is particularly difficult, and deriving the re- 
quired structures from a hierarchy specification would alleviate much of 
this burden. Other cited work, (for example [16]) focus on other difficult 
problems such as usability, via providing coercions and unification hints 
to match particular terms to theories. Even though some similar tech- 
niques (categorical pullbacks) are used in a similar context, the details 
are very different. 

7 Conclusion 

There has been a lot of work done in mathematics to give structure to 
mathematical theories, first via universal algebra, then via category the- 
ory (e.g. Lawvere theories). But even though a lot of this work started 
out being somewhat syntactic, very quickly it became mostly semantic, 
and thus largely useless for the purposes of concrete implementations. 

We make the observation that, with a rich enough type theory, we 
can identify the category of theory presentations with the opposite of the 
category of contexts. This allows us to draw freely from developments 
in categorical logic, as well as to continue to be inspired by algebraic 
specifications. Interestingly, key here is to make the opposite choice as 
Goguen's in two ways: our base language is firmly higher-order, while our 
"module" language is first-order, and we work in the opposite category. 

We provide a simple-to-understand term language of "theory expres- 
sion combinators" , along with multiple (categorical) semantics. We have 
shown that these fit our requirements of allowing to capture mathematical 
structure, while also allowing this structure to be hidden from users. 
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Even more promising, our use of very standard categorical construc- 
tions points the way to simple generalizations which should allow us to 
capture even more structure, without having to rewrite our library. Fur- 
thermore, as we are independent of the details of the type theory, this 
structure seems very robust, and our combinators should thus port easily 
to other systems. 
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A Proofs 

Given a nominal assignment of the form i-> 1^]™^, we call the func- 
tion a : N|^| — > N|p| above an indexing function. The indexing function 
of a nominal assignment is sufficient to specify the nominal assignment. 
Given a morphism A : r — > A with indexing function a : — >• 1N\r\ 
and a morphism B : A — >■ E with indexing function b : N|=| —> N|^|, that 
the composition B o A : r — > E has a o b : N| S | — > N|_p| for its indexing 
function. 

Proof (of Theorem 1, Section 3). Suppose A = [yi i-> x a ^Y"_Q ■ Because 
a is injective for general extensions, we can select a permutation of names 
ir a '■ V —¥ "V such that tta (yi) = x a (i)- This implies that tta • A is a sub- 
context of r + . Consider the renaming I -i (tta • A) : A — > tta • A. We 

A 

see that the composition I -i (tta ■ A) o A: r + — v tta • A is an extension 

n A 

since 

^-i (tta ■ A) o A = [tt a (Vi) yi}™^ 1 o [ yi ^ x a(i) ]™~ 1 

= [VTA (Vi) ^ ZawE 1 = t X «W ^ ^WI^O 1 

is the diagonal assignment. □ 

Proof (of Corollary 1, Section 3). From the previous theorem we know 
that there is a permutation 71-^4 : "V — >■ "V such that (tta ■ A) o A : 
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r + — y ir a ■ A is an extension. Let r := it a • A and let A e := I -1 (7ta • ^) ° 
A. Then we can take A r := I^ A (A) : r — > A. We immediately see that 

A r oA e = I nA (A) o Z^-i (ir A ■ A)oA 

= I nA (A)o(I WA (A))- 1 oA = A. 

□ 

Proof (of Lemma 1, Section 3). By Theorem 1 every general extension 
is isomorphic to an extension, so it suffices to show that every extension 
A : r + — > r is isomorphic to an extension B : r° — >■ r where r is an 
initial segment of r°. If r is already an initial segment of r + , then we 
can just take r° := r + . Otherwise there exist i and j such that i < j < s 
and 

r — (X(J . To, . . . , Xj . Tj, . . . , Xj . Tj , . . . , X s _l . T s _l) 

where x«_i is not in r, x«, . . . , Xj_i are all in r, and Xj, . . . , x s _i are all 
not in r. Because is a well formed context, it must be the case that 
Xj_i does not occur in Tj, . . . , Tj-i, and we can safely rearrange r + into 

T+ = (x : r ; . . . ; Xj_ 2 : Tj_ 2 , %i ■ n; . . . ; Xj_i : Tj_i, 

Xj— 1 . Ti—\,Xj . Tj, . . . , X s _x . T s _l). 

This new context -T + is isomorphic to r + via 5p+ : r + — > F + . Continuing 
by induction on i, we can eventually construct a r° which is isomorphic to 
r + via 5 r + : r° — > F + such that I 1 is an initial segment of r° . Therefore 
A : r + — > r and A o 5 r + :r°->r are isomorphic in E. □ 

Proof, (of Theorem 2, Section 3) To prove that cod : E — > B is a fibration 
we need to show that for any nominal assignment u : r — > A from B 
and a general extension A : A + — > A, there exists a Cartesian lifting 
u (A) : u* (A) -> ^ in E. 

We need to show that u (A) is a pullback diagram of the cospan 

r A i— A + in B and that u* (A) is a general extension. By Lemma 1 
we can assume that A is an extension and A is an initial segment of A + . 
We can further suppose that the names in A + are disjoint from the names 
r, by applying a suitable permutation it to A and noting that if A is an 
initial segment of A + then it ■ A is an initial segment of it ■ A + . 

Suppose A + = (y : a) r Q +e ~ l and u := [j/j i-> x a (j)]™ • Define an 
extension of T, T+ := (x : t)q +6=1 , where x n+fc := y m+k and r n+fc := 
o-m+fc [yi ^ Xa(i)]™L for a11 k < e. Let u* (A) := <5 r : T+ ->• T. By 
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definition u* (A) is an extension (and r is an initial segment of r + ). 
We can define a nominal assignment u + (A) : r + —> A + by u + (A) := 

[Vi ^ x a(i)]^Lo N t ym+i ^ x n+i\iZo- % construction, u o u* (A) = A o 
u + (A), and so tZ (.A) := u) : u* (A) — > A is a commutative square. 

To show that u (A) is a Cartesian lifting of u over A, consider some 
X : S + —7- H in IE along with g : X —> A and u : S — > J 1 such that 
uoti = cod (<?). Again, without loss of generality, we can assume that X 
is an extension and £ is an initial segment of S + . 



Suppose E = (z : £) 



r-l 



[yi ^ ^c(i)] T= V 1 - We can define 



r+s— 1 



n-1 



3=0 



r + as u x [xj i-> z c (j)] 



,<T = 

m+e— 1 

j=m 
m— 1 



To see that v + is well formed, recall that r ra +fc = cr ra +fc [yi h-> aJ a (i)]j. (l 
Because for all i < m, c (i) = b (a (i)), 



T n+k [v + ] = T n+k V X [Xj Z c(j -)]" 



1 m+e— 1 



r -im+fe-1 




case that for all i < n, d (i) 
be the case that for all k < e, d (n + k) 
and so w = v. 



which is well typed in S because g + is well- 
defined. Clearly, u* (A) o v + = v o X, and 
g+ = u + {A) o v + by construction. There- 
fore, v := v ) : X — > u* {A) is an arrow 
in IE such that uov = g. 

Finally, we need to show that v is the 
unique arrow in IE such that u o v = g and 
cod (v) = v. Suppose w : X — > u* {A) 
is another arrow such that u o w = g and 
cod (w) = v. Say w + := [xj ^ Zd(j)] J=o ■ 
Because u* (A) ow + = X ov, it must be the 
= b(i). Because u + (A) o w + = g + , it must 



c{m + k). Therefore, w 



□ 



Proof (of Corollary 2, Section 3). For such a u, a cartesian lifting A (u) : 
A* (u) —7- u is isomorphic to the transpose of u(A), and in particular 
■u (A) + will be isomorphic to A* (u). Since A* (u) is a general extension, 
then so is u (A) + . □ 



